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miss, in other words, before a cache miss occurs. As a result, 
during a cache miss, only a load penalty occurs, and the occurrence 
of a write back penalty can be reduced. With this, it is possible to 
improve cache memory efficiency and improve access speed. 
[0025] The processor does not need to perform line address 

management for the write back operation, and can specify an 
arbitrary address range regardless of the line size and line 
boundaries of the cache memory. 

Brief Description of Drawings 

[0026] FIG. 1 is a block diagram showing the outline structure of the 
system including the processor, cache memory, and memory, in the 
first embodiment of the present invention. 

FIG. 2 is a block diagram showing an example of the structure 
of the cache memory. 

FIG. 3 is a diagram showing the detailed bit structure of a 
cache entry. 

FIG. 4 is a block diagram showing the structure of the control 

unit. 

FIG. 5 is a block diagram showing an example of the structure 
of the C flag setting unit. 

FIG. 6A shows an example of an instruction for writing a start 
address into the start address register. 

FIG. 6B shows an example of an instruction for writing a size 
into the size register. 

FIG. 6C shows an example of an instruction for writing a 
command into the command register. 

FIG. 6D shows an example of a command. 

FIG. 7 shows an explanatory diagram for the start aligner and 
the end aligner. 

FIG. 8 is a flowchart showing an example of the C flag setting 
process by the flag rewriting unit. 



FIG. 9 is a flowchart showing an example of the cleaning 
process by the cleaning unit. 

FIG. 10 is a block diagram showing the structure of the cache 
memory in the second embodiment of the present invention. 

FIG. 11 is a diagram showing the bit structure of a cache 

entry. 

FIG. 12 is a block diagram showing the structure of the 
control unit. 

FIG. 13 shows an example of the use flag updating by the 
replacement unit. 

FIG. 14A is a diagram showing the replacement of cache 
entries in the case where a weak flag does not exist. 

FIG. 14B is an explanatory diagram showing the role of the 
weak flag W in the replacement process. 

FIG. 15 is a flowchart showing the W flag setting process by 
the W flag setting unit. 

FIG. 16 is a flowchart showing the cleaning process by the 
cleaning unit. 

FIG. 17 is a flowchart showing the U flag updating process by 
the replacement unit. 

FIG. 18 is a flowchart showing the replacement process by the 
replacement unit. 



Numerical References 

[0027] 1 Processor 

2 Memory 

3 Cache memory 

20 Address register 

21 Memory I/F 
30 Decoder 
31a to 31d Ways 

32a to 32d AND circuits 



34 OR circuit 

35 Selector 

36 Selector 

37 Demultiplexer 
5 38 Control unit 

39 Cleaning unit 

40 C flag setting unit 

41 Replacement unit 
131b to 131d Ways 

10 138 Control unit 

139 Cleaning unit 

140 W flag setting unit 

401 Command register 

402 Start address register 
15 403 Size register 

404 Adding unit 

405 Start aligner 

406 End aligner 406 

407 Flag rewriting unit 

20 

Best Mode for Carrying Out the Invention 

[0028] (First Embodiment) 

<Overall structure> 

FIG. 1 is a block diagram showing the outline structure of a 

25 system including a processor 1, a cache memory 3, and a memory 2 
in the first embodiment of the present invention. As shown in the 
diagram, the cache memory 3 in the present invention is included in 
a system having the processor 1 and the memory 2. 
[0029] The cache memory 3 holds, for each cache entry which 

30 holds cache-unit data (called line data), a caching termination 
attribute indicating whether or not caching of the cache entry is 
allowed to be terminated; selects, regardless of whether or not a 
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cache miss occurs, a cache entry having a caching termination 
attribute indicating that caching is allowed to be terminated, and 
which is set with a dirty flag indicating that it has been written into 
by the processor; and writes back, to the memory, data of such 
5 selected cache entry. Here, the caching termination attribute 
indicating that caching is allowed to be terminated is added to a 
cache entry which is hereafter not likely to be written into, a cache 
entry which is hereafter not likely to be read/written into, and the 
like. 

10 <Structure of the cache memory> 

[0030] Hereinafter, the structure in the case where the present 

invention is applied to a 4-way set-associative cache memory shall 
be described as a specific example of the cache memory 3. 

FIG. 2 is a block diagram showing an example of the structure 

15 of the cache memory 3. As in the diagram, the cache memory 3 
includes an address register 20, a memory I/F 21, a decoder 30, four 
ways 31a to 31d (hereinafter abbreviated as ways 0 to 3), four 
comparators 32a to 32d, four AND circuits 33a to 33d, an OR circuit 
34, selectors 35 and 36, a demultiplexer 37, and a control unit 38. 

20 [0031] The address register 20 is a register which holds an 

access address to the memory 2. Such access address is assumed 
to be of 32 bits. As shown in the same diagram, the access address 
includes, in order from the highest bit, a 21-bit tag address, a 4-bit 
set index (SI in the diagram), and a 5-bit word index (WI in the 

25 diagram). Here, the tag address indicates an area (with size being, 
the number of sets multiplied by a block) within the memory which 
is mapped by a way. The size of this area is the size defined by 
address bits (A10 to AO) which are lower than the tag address, in 
other words, 2k bytes, and is also the size of one way. The set 

30 index (SI) indicates one of a plurality of sets which straddle the ways 
0 to 3. As the set index is 4 bits, the number of sets is sixteen sets. 
A cache entry which is identified by the tag address and the set 
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index is the unit for replacement, and is referred to as line data or a 
line when stored in the cache memory. The size of line data is the 
size defined by the address bits lower than the set index, in other 
words, 128 bytes. With one word being 4 bytes, one line data is 32 
5 bytes. The word index (WI) indicates one word within plural words 
making up the line data. The lowest 2 bits (Al, AO) within the 
address register 20 are disregarded during word accessing. 
[0032] The memory I/F 21 is an I/F for accessing the memory 

2 from the cache memory 3 such as in writing back data from the 
10 cache memory 3 to the memory 2, and loading data from the 
memory 2 to the cache memory 3. 

[0033] The decoder 30 decodes the 4 bits of the set index, and 

selects one set among the sixteen sets spanning the four ways 0 to 
3. 

15 [0034] The four ways 0 to 3 are four ways having the same 

structure and a capacity of 4x2k bytes. Each way includes sixteen 
cache entries. 

[0035] FIG. 3 shows the detailed bit structure of a single cache 

entry. As in the diagram, a cache entry has valid flags VO to V3, a 
20 21-bit tag, a 128-byte line data, a caching termination attribute (C 

flag, in the diagram), and dirty flags DO to D3. 

[0036] The tag is a copy of a 21-bit tag address. 

The line data is a copy of 128-byte data within a block 

identified by the tag address and the set index, and is made up of 
25 four 32-byte sub-lines. 

[0037] The valid flags V0 to V3 correspond to the four 

sub-lines, and indicates whether or not a respective sub-line is 

valid. 

[0038] The caching termination attribute (cleaning flag C) 

30 indicates whether or not caching is allowed to be terminated. It 
represents, for example, whether or not writing will be performed 
hereafter on the cache entry. C = 0 means that there is a possibility 
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that writing will be performed hereafter. C=l means that writing 
will not be performed hereafter, and means that, when such cache 
entry is dirty, cleaning (write back) should be carried out. 
[0039] The dirty flags DO to D3 correspond to the four 

5 sub-lines, and indicate whether or not a respective sub-line has 
been written into by the processor, in other words, whether or not a 
write back is necessary as, although cached data exists within a 
sub-line, such data is different from the data within the memory as 
a result of the write operation. 

10 [0040] The comparator 32a compares whether or not the tag 

address within the address register 20 matches the tag of the way 0 
among the four tags included in the set selected according to the set 
index. Aside from corresponding to the ways 31b to 31d 
respectively, everything else is the same for the comparators 32b to 

15 32d. 

[0041] The AND circuit 33a compares whether or not the valid 

flag matches the comparison result from the comparator 32a. The 
result of this comparison shall be assumed to be hO. In the case 
where the comparison result hO is 1, this means that there exists 

20 line data corresponding to the tag address within the address 
register 20 and the set index, in other words, a hit has occurred in 
the way 0. In the case where the comparison result hO is 0, this 
means that a mishit has occurred. Aside from corresponding to the 
ways 31b to 31d respectively, everything else is the same for the 

25 AND circuits 33b to 33d. Respective comparison results hi to h3 
represent a hit or miss occurring in the ways 1 to 3. 
[0042] The OR circuit 34 carries out the OR for the comparison 

results hO to h3. The result of this OR is assumed as a hit. A hit 
indicates whether or not a hit occurs in the cache memory. 

30 [0043] The selector 35 selects, among the line data of the 

ways 0 to 3 in the selected set, the line data of the way in which a hit 
occurs. 



[0044] The selector 36 selects, from within the 32-word line 

data selected by the selector 35, one word which is indicated by the 
word index. 

[0045] The demultiplexer 37 outputs write data to one of the 

ways 0 to 3, during the writing of data into the cache entry. The 
write data may be in 1-word units. 

[0046] The control unit 38 performs the overall control of the 

cache memory. In particular, it performs the setting of a C flag and 
the cleaning (write back) in accordance with such C flag. 
<Structure of the control unit> 

[0047] FIG. 4 is a block diagram showing the structure of the 

control unit 38. As in the diagram, the control unit 38 includes a 
cleaning unit 39 and a C flag setting unit 40. 

The cleaning unit 39 searches for a cache entry which is set 
with C=l, and performs a write back when such cache entry is dirty. 
[0048] The C flag setting unit 40 sets a cleaning flag C in 

accordance with a command from the processor 1. The processor 1 
issues, to the cache memory 3, a command instructing the setting of 
a cleaning flag to a cache entry which will no longer be written into. 
<Structure of the C flag setting unit> 

[0049] FIG. 5 is a block diagram showing an example of the 

structure of the C flag setting unit 40. As in the diagram, the C flag 
setting unit includes a command register 401, a start address 
register 402, a size register 403, an adding unit 404, a start aligner 
405, an end aligner 406, and a flag rewriting unit 407. 

The command register 401 is a register that allows direct 
access from the processor 1, and holds a C flag setting command 
written by the processor 1. FIG. 6C shows an example of an 
instruction to write a command in the command register 401. This 
instruction is a normal transfer instruction (mov instruction), and 
specifies the command as a source operand and the command 
register (CR) 401 as the destination operand. FIG 6D shows an 
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example of a command. This command is a specific code indicating 
a C flag setting command. The C flag setting command is a 
command which instructs the setting of C flags to cache entries 
which hold the data corresponding to an address range which starts 
from the start address held in the start address register 402 and 
which has the size held in the size register 403. 
[0050] The start address register 402 is a register which 

allows direct access from the processor 1, and holds a start address 
written by the processor 1. This start address indicates the starting 
position of an address range on which the C flags should be set. 
FIG. 6A shows an example of an instruction for writing a start 
address into the start address register 402. As in FIG. 6C, this 
instruction is also a normal transfer instruction (mov instruction). 
[0051] The size register 403 is a register which allows direct 

access from the processor 1, and holds a size which is written by the 
processor 1. This size indicates an address range starting from the 
start address. FIG. 6B shows an example of an instruction for 
writing a size into the size register 403. As in FIG. 6C, this 
instruction is also a normal transfer instruction (mov instruction). 
Moreover, the size unit may be a number of bytes or a number of 
lines (number of cache entries), as long as it is a unit that is fixed in 
advance. 

[0052] The adding unit 404 adds the start address held in the 

start address register 402 and the size held in the size register 403. 
The result of this addition is an end address indicating an ending 
position of the address range. The adding unit 404 adds byte 
addresses when the size is specified as a number of bytes, and adds 
line addresses when the size is specified as a number of lines. 
[0053] The start aligner 405 adjusts a start address to the 

position of a line boundary. With this adjustment, the processor 1 
can specify an arbitrary address as the start address, regardless of 
the line size and the line boundary. 
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[0054] The end aligner 406 adjusts an end address to the 

position of a line boundary. With this adjustment, the processor 1 
can specify an arbitrary size for the aforementioned size, regardless 
of the line size and the line boundaries. 

[0055] FIG. 7 shows an explanatory diagram for the start 

aligner 405 and the end aligner 406. In the diagram, the start 
address specified by the processor 1 indicates an arbitrary position 
midway through a line N. The start aligner 405 makes an 
adjustment so that the beginning of the next line (N+l) is indicated, 
and outputs the adjusted address as an aligned start address. The 
line indicated by the aligned start address is called a start line. 
[0056] Furthermore, the end address indicates an arbitrary 

position midway through a line M. The end aligner 406 makes an 
adjustment so that the beginning of the immediately preceding line 
(M-l) is indicated, and outputs the adjusted address as an aligned 
end address. A line indicated by the aligned end address is called 
an end line. 

[0057] In this case, C flags are set to respective lines (cache 

entries) from the start line (line (N + l)) to the end line (line (M-l)). 
In this manner, the start aligner 405 and the end aligner 406 align 
inwards of the address range, from the start address to the end 
address, specified by the processor 1, as there is a possibility that a 
write operation by the processor may occur on the outer portions of 
the line N and the line M. 

[0058] The flag rewriting unit 407 sets, to 1, the C flags from 

the line indicated by the aligned start address to the line indicated 
by the aligned end address (in the example in FIG. 7, from the line 
(N + 1) to the line (M-l)), when corresponding data is in the cache 
memory 3. 

<C flag setting process> 

[0059] FIG. 8 is a flowchart showing an example of the C flag 

setting by the flag rewriting unit 407. 
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In the case where a C flag setting command is held in the 
command register 401, the flag rewriting unit 407 performs a loop 1 
process while sequentially outputting the respective line addresses 
from the start line to the end line (S82 to S86). Here, as the flag 
5 rewriting unit 407 performs the same process for each line, the 
process for only one line shall be discussed. 

[0060] More specifically, while the cache memory 3 is not 

accessed by the processor 1, the flag rewriting unit 407 outputs a 
line address to the address register 20 (S83); causes the 

io comparators 32a to 32d to compare the tag address in the address 
register 20 and the tag of the cache entries; and judges whether or 
not there is a hit (S84). In addition, in the case of a hit, the flag 
rewriting unit 407 sets 1 to the C flag of the cache entry where the 
hit occurred (S85) and, in the case of a mishit, nothing is done as 

15 corresponding data is not in the cache memory 3. 

[0061] With this, 1 is set in the C flags of each of the lines from 

the start line to the end line, in the case where corresponding data 
is in the cache memory 3. 
<Cleaning process> 

20 [0062] FIG. 9 is a flowchart showing an example of the 

cleaning process by the cleaning unit 39. 

As in the diagram, by sequentially specifying set indices (SI) 
0 to 15 (S901) in the loop 1 processing (S900 to S913), the cleaning 
unit 39 performs loop 2 processing on all the 16 sets. By reading 

25 the respective C flags of the ways within a set (S903) in the loop 2 
processing (S900 to S913), the cleaning unit 39 searches for a cache 
entry having C=l (S904). In loop 3 processing (S905 to S910), the 
cleaning unit 39 reads, on a per sub-line basis, dirty flags of the 
cache entry having C=l (S906), and when dirty (S907), writes back, 

30 to the memory 2, the data of such sub-line (S908) and resets the 
dirty flag of such sub-line to 0 (S909). In the write back of such 
sub-line data, the cleaning unit 39 writes back, in an idle cycle 
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(S920), one word at time (S922), as in loop 4 processing (S920 to 
S923). 

[0063] In this manner, the cleaning unit 39 sequentially 

checks the C flags of all the cache entries, searches for a cache entry 
5 having C=l, and, when dirty, writes back such cache entry from the 
cache memory to the memory 2. 

[0064] As described thus far, according to the cache memory 

in the present embodiment, a caching termination attribute (C flag) 
indicating whether or not caching is allowed to be terminated is 

io added to a cache entry belonging to an address range specified by 
the processor 1, and a cache entry which will not be written into is 
written back before the occurrence of a cache miss. Therefore, 
during a cache miss, only a load penalty occurs, and the occurrence 
of a write back penalty can be decreased. With this, it is possible to 

15 improve the efficiency of the cache memory, and improve access 
speed. 

[0065] Furthermore, the processor can specify arbitrary 

addresses as the address range to which caching termination 
attributes are set, regardless of the line size and line boundaries of 
20 the cache memory. With this, the need for a programmer and 
compiler to be conscious of the line size and line boundaries is 
eliminated and, therefore, cache address management with respect 
to cleaning need not be performed, and program creation can be 
made easier. 

25 [0066] In addition, since only a dirty sub-line is written back 

on a sub-line basis in the cleaning process, high-speed write back 
can be facilitated as compared to when write back is performed on a 
per line basis. 
<Variations> 

30 [0067] Note that the cache memory in the present invention is 

not limited to the structure in the aforementioned embodiment, and 
various types of variations are possible. Hereinafter, a number of 
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variations shall be described. 

[0068] (1) In FIG. 5, the C flag setting unit 40 may include an 

end address register instead of the size register 403 and the adding 
unit 404. In this case, the end address register holds an end 
address written by the processor 1. 

[0069] (2) The processor 1 may execute a store instruction for 

writing data while adding a caching termination attribute. In 
addition, the control unit 38 may include an instruction detection 
unit which detects the store instruction for writing data while adding 
a caching termination attribute, and a flag setting unit which sets 
C = l during the writing in accordance with such store instruction. 
[0070] (3) The respective instructions shown in FIG. 6A, 6B 

and 6C may be inserted, by a compiler, within a program. At that 
time, the compiler may insert the above-mentioned respective 
instructions in a position in the program which will not be performed 
of further write operations such as the writing of array data, the 
writing of block data during the decoding of compressed video data, 
and the like. 

[0071] (4) It is also possible to have a structure in which the 

cache entry does not hold a C flag. Stated differently, the C flag 
setting process shown in FIG. 8 and the cleaning process shown in 
FIG. 9 may be performed at the same time. In this case, in place of 
setting a C flag to a cache entry in S85 in FIG. 8, the loop 3 (S905 
to S910) in FIG. 9 may be executed. 

[0072] (5) Although in the aforementioned embodiment, 

description is made exemplifying a 4-way set-associative cache 
memory, the number of ways may be 8 ways or 16 ways. 
Furthermore, although in the aforementioned embodiment, an 
example having 16 as the number of sets is described, it is possible 
to have any number of sets. 

[0073] (6) Although in the aforementioned embodiment, 

description is made exemplifying a set-associative cache memory, it 
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ls possible to have a ful.y associative cache memory. 

(Second Embodiment) reaarding a 
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^.^rr^ ------ 

structure of the Cache memory ^ ^ ^ ^ ^ 

° 100741 "in he second embodiment of the present invention, 
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The cache memory in the diagram s d.ffe en ^ ^ 

struc ture in FIG. 2, in induding ways 131. « » he 
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15 control unit 38. Herema. , ^hich are the same, 
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■ : do i:,Tn ea e ch o! a tn: cache entries. The same is true for the ways 
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data, a week flag W, a use nag rporesen t s whether or not 

[0077] Among these, the weak nag <<%™* from the 

r nc^rip with regard to acce^ 

26 there will be any further usage w.t 9 , in the 

cache memory, the weakest j manner sinC e the weak 

evicted before other cache entrie, n — q Qf 
flag W has two meanings, it is rererreo 
30 cleaning and replacement ^ ^ ^ hgs been 

[0 078] The use flag U .nd.ca replacement due to a 

an access to the cache entry, and .s used, 
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m.shU using the LRU scheme, In place of access o e ^ 
the cache entries of the four ways. More accurately, a 1 m the use 
ag U means an access has been made, and a 0 In the use Hag 
means no access has been made. However, when the use 
5 the four ways within a set all become 1, they are reset too. Stated 

d fferently. the use flag U Indicates the two relative states of 
whether t e time of access Is o,d or new. In other words, ,t means 
That a cache entry with a use flag as 1 has been accessed more 
recently than a cache entry having a use flag as 0. 
10 f0079] The control unit 138 is different, compared to the 
ontro unit 38, in that it sets a W flag instead o, the C flag and ,n 
that it uses a use Hag U in place of the access order information for 
the LRU scheme. 
<Structure of the Control unit> 
15 ,0080) FIG. 12 is a block diagram showing the structure of the 
ontro, unit 138. The control unit 138 in the diagram is , ferent 
compared to the control unit 38, in Including a cleaning un, 139 and 
a W flag setting unit 140 in place of the cleaning un,t 39 and the C 
f,ag setting unit 40 respectively, and in having a replacement unit 41 

20 The cleaning un.. 139 .. different, compared to the 

cleaning unit 39, in that it refers to the W flag instead of the ^ 
As a result, write back is performed on a cache line which has W-l 

and which is dirty. 

25 [0082] The W flag setting unit 140 sets a weak flag W 
ccord.ng to a command from the processor 1. The processor 1 
issues to the cache memory 3, a command instructing the sett.ng of 
a weak flag to a cache entry which will no longer be used (wr te i and 
rea d) A cache entry having W-l. when dirty, becomes the subjec 

ao for cleaning and. in addition, is the first subject for replacement 
when a cache miss occurs. 

[0083] The replacement unit 41 performs replacement dunng 
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Hinn to a oseudo-LRU scheme with the use flag U 
a cache miss, accordmg to a P»" yse y when the 

serving as the access order and upda J , cache 

cache memory is accessed^ in he^r ^ ^ ^ ^ ^ 

entry having W=l is tne very 
replacement. 

description of the use flag U> the 
[00841 FIG. 13 shows an example of use f ag P 
replacement unit 4, The upper s ag — stag ; g ^ n 
sta9 e in the diagram repres t four che ^ ^ ^ 

„ which straddles the ways 0 to 3. ™ 1 0 Tne four use flags 

four cache entries are respective use flag values. 

U are written as UO to U3. 

m the too stage of the diagram, (UO to UJ) u, 
[0085] m the top stay respective cache 

L uuo /J *.u Q rar hp entrv of the way o 

,„ the diagram, when a hit occurs m the each, ent y = 

- w av 3 , s updated ^ . : , r rr . 

shown hy the hro.en ™"?™Z^^ *« the 

srsscri- - 

E ! of a cache miss, the replacement unit 4! determ.nes the cache 
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entry to be the subject for replacement based on the use flags, and 
performs the replacement. For example, the replacement unit 41 
determines either one of the way 1 and the way 3 to be the subject 
for replacement in the upper stage in FIG. 5; determines the way 3 
as the replacement subject in the middle stage in FIG. 5; and 
determines any one of the ways 0 to 2 to be the replacement subject 
in the lower stage in FIG. 5. 
<Description of the Weak flag> 

[0089] FIG. 14A is a comparative example for when a weak 

flag is assumed not to exist, and is a diagram showing the 
replacement of a cache entry. The diagram also shows, as in FIG. 
13, the four cache entries making up the set N which straddles the 
ways 0 to 3. The 1 or 0 at the right end of the four cache entries are 
the values of the respective use flags. Furthermore, it is assumed 
that only data E is seldom accessed data, and data A, B, C, and D are 
frequently accessed data. 

[0090] In the situation in the first stage of FIG. 14A, a cache 

miss occurs when the processor 1 accesses the data E. Due to this 
cache miss, for example, among cache entries having U = 0, the 
cache entry of the frequently accessed data C is replaced with the 
seldom accessed data E, and the situation in the second stage is 
arrived at. 

[0091] In the situation in the second stage, a cache miss 

occurs when the processor 1 accesses the data C. Due to this cache 
miss, the cache entry of the frequently accessed data D, which is a 
cache entry having U = 0, is replaced with the frequently accessed 
data C, and the situation in the third stage is arrived at. 
[0092] In the situation in the third stage, a cache miss occurs 

when the processor 1 accesses the data D. Due to this cache miss, 
for example, the cache entry of the frequently accessed data C is 
replaced with the frequently accessed data D, and the situation in 
the third stage is arrived at. 
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c *h ^ a ne the seldom used data E is 
memory. 5 since the seldom used data 

100945 j;< r-oTS^ r. 5 '*. - — ^ 

5 E is the oldest (U-u), 

and is evicted. pseudo-LRU scheme (likewise, in 

[00953 in this manner m the pseu ^ , cache 

he normal LRU scheme), fQur ways , due t „ the 

miss is induced four times when there 

- - ld ° m is an explanatory diagram showing the role of 

Tewea* f lag W in the 14B (same as the first 

In the situation in the , hrst stage m ^ 1 

sta ge in FIG. 14A. a cache m s oca amQng 
„ accesses the data |>«' » ^ entry of the freguentiy 
the cache entries having U-0. the ^ ^ £ At 

accessed data C is 4 sets a weak flag W as L 

thl s time, it is — ed ^ th this , the cache entry of the data 

„ - «- - the situat,on ' 

the second stage is arrived at a cache miss 

10097, m the situat.c, , n he sec ^ ^ ^ ^ ^ cache 
occurs when the processor 1 acces ^ |s the 

m iss, the cache entry of J. ; * suh.ect for replacement, 

- r - - c - and the situat,on 

stage 3 Is arrived at. wegk f|ag# jt is possible 

r:.. «r=^-- i - — — — 

30 data. 

<w flag -"'^^lowchart showing the W flag setting by the 
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e of the W flag setting unit 140 
w flag setting unit 140. The structur 5 and , , n addition, 

I the same as the C flag setting ^ ^iZ 1 is also the same as 
the W flag setting command from the P sets the addre ss 

[0X00, FIG. 15 is d,fferent compa ^ ^ ^ as t „ e 

step S1 8S in Piace o f the ; s ep 8, ^ ^ ^ 

cw :::;:^ f i a9 « : -r 

cache entry in « W c h a h , occurs, accords to S84. 

<C,eaning process> hart ^ cleanlng by the 

toioi] FIG - lb 11 

cleaning unit 139. rnmM red to FIG. 9, in having S903a, 

Th e diagram is different, ~™£ red and S9U . As aU of 
S904a, and S9U. in £=- °» 90 ^ ^ havlng 

rr^driis.dLcriptionshai, he omitted. 

i0 <u flag updating process> ^ y f|ag updat ,n« by 

[0102] FI G. 17 ,s a «- cha , „ assume d that the use 

0- „ , rache hit occurs (Step S61). the 

in the diagram, when a each ^ ^ ^ jn the set 

repl acement unit 41 sets to i t ^ occurred (5tep 

selected according to the - deX ' the ways in suc h set (Step 

562) , reads the use flags U of the re ^ y ^ jndjcate , 

563) , and iudges whether or not the ^ 
(Step S64). When ... do „■* nd ca * 1 ^ ^ ^ ^ ^ ways 
and, when all indicate 1, all the 
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are 'reset to 0 (Step S65). 
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nnpr the replacement unit 41 updates the 

°" , " < "°' 

14A and 14B. 

< Replacement process> ^placement process 

[0104] HG. 18 « - joweh-rt J when a m . |ss occurs jn 

by the replacement unrt 41. In unjt 41 rea ds the use 

a m emory access ^ four ways ,„ the set selected 

flags U and the weak « whetner or not a 

according to the set r*^ >• ft „ judged that there Is 

n0 „ ay having W-l, one w Y havjng u$e f , ags as 0 , 

At this time, when there « a plu ahty of V Qre _ when 

the placement ^™^ ^ e way having W = l is 
it is judged that a way having w At this time, 

placement, with the cache er , r o * ^ the use 
!0 tne subject (Step S96) and, after rep ^ ^ x 

flag U and the weak flag W of sue ^ ^ y ^ 

respectively (Step S97). Moreov r at ^ 

lhe dirty flag D are i-^ having w = 1 does not exist, 

among c 3 che entries of weys having W . o or 1. 

for tepiecement, regardless of whether h ^ ^ ^ 

due to seldom accesseu u 
shown in FIG. 14A and 14B. 
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writ e penalties at the t.me of a each ^ ^ ^ ^ 

, [01091 ^77^";^ ha v,n 9 W-l selected first 
no longer be used, and the cache e , y inducement 

as th e replacement '^^Jwng left in the cache 

of cache misses due to seldom accessed 

memory. f , beco mes sufficient as 

0 [0110] Furthermore, s.nce a 1 tat t 9 scneme 
Less order data through the adopt on P ^ ^ 

us ,ng a 1-bit use flag in place »« * can be made 

tne conventional LRU scheme a dware d ^ ^ ^ 

smaller as the data volume of the access 
15 updating is simple. 

<Variations> „„„ rv in the present invention is 

[01111 Note that the ^^Z ^Z embodiment, and 
not limited to the structure m the afore- ^ 
various types of variations are possible. 
20 variations shall be described. ^ ^.^ jn p , ace 

[0112] (1) « '« P° sslb,e t0 ha n V6 t a he use flag o, order data 
ot th e pseudo-URU scheme us,n the u ^ 
indicating the access order for the four ways ^ 
■ a per cache entry basis, and th« = sutae for ep ^ ^ ^ 

- :::: s - - — — 

regardless of the access order ^ whjch bQth 

/•^ n iq also possible to nave a bu u 
[0113] (2) " s s ° P embodime nt and the W flag m the 
the C flag shown in the fust emb it is p0SS ible to use 

3„ present embodiment are provded. in h s ^ ^ 

the C flag in the cleaning process a d us ^ q ^ 

selecting the subject for replacement. In so do 
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. cache entry with the P oss T - : ;rr d rr; r 

lon ger be written into, can be innaer be read from nor written 
Ceaning. A cache entry that w In o nge > replacemen , 
■ mt0 is Ceaned and seiected f.rst .. the J ^ ^ 

6 With the combined use of the C nag 

control can be performed. fnrpmenti0 ned embodiment, the 

[0U4] (3) Although in e o m - ^ ^ ^ ^ 
slze of a sub-Hoe is made out to be 1/ ^ ^ ^ 

flags as the number of sub-lines. 

industrial Applicability a s a cache memory for 

[0U5] The present invents ,s su, ab, as a ^ 

u facilitating high-speed memory »«^ cachfi memory , a d ata 
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